首页> 外文OA文献 >Learning Macromanagement in StarCraft from Replays using Deep Learning
【2h】

Learning Macromanagement in StarCraft from Replays using Deep Learning

机译:使用深度学习从重播中学习星际争霸中的宏观管理

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The real-time strategy game StarCraft has proven to be a challengingenvironment for artificial intelligence techniques, and as a result, currentstate-of-the-art solutions consist of numerous hand-crafted modules. In thispaper, we show how macromanagement decisions in StarCraft can be learneddirectly from game replays using deep learning. Neural networks are trained on789,571 state-action pairs extracted from 2,005 replays of highly skilledplayers, achieving top-1 and top-3 error rates of 54.6% and 22.9% in predictingthe next build action. By integrating the trained network into UAlbertaBot, anopen source StarCraft bot, the system can significantly outperform the game'sbuilt-in Terran bot, and play competitively against UAlbertaBot with a fixedrush strategy. To our knowledge, this is the first time macromanagement tasksare learned directly from replays in StarCraft. While the best hand-craftedstrategies are still the state-of-the-art, the deep network approach is able toexpress a wide range of different strategies and thus improving the network'sperformance further with deep reinforcement learning is an immediatelypromising avenue for future research. Ultimately this approach could lead tostrong StarCraft bots that are less reliant on hard-coded strategies.
机译:实时战略游戏《星际争霸》(StarCraft)已证明是人工智能技术面临的挑战性环境,因此,当前的最新解决方案由众多手工制作的模块组成。在本文中,我们展示了如何使用深度学习直接从游戏重播中学习《星际争霸》中的宏管理决策。对从高技能玩家的2,005次重播中提取的789,571个状态-动作对进行了神经网络训练,在预测下一个构建动作时,前1个和前3个错误率分别为54.6%和22.9%。通过将训练有素的网络集成到开放源代码StarCraft机器人UAlbertaBot中,该系统可以大大胜过游戏内置的Terran机器人,并通过固定的抢先策略与UAlbertaBot竞争。据我们所知,这是第一次从StarCraft中的重放直接学习宏管理任务。虽然最佳的手工制定策略仍是最新技术,但深度网络方法能够表达各种不同的策略,因此通过深度强化学习进一步改善网络的性能是未来研究的直接途径。最终,这种方法可能导致强大的StarCraft机器人较少依赖硬编码策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号